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Synopsis 

Enzymes synthesizing the bacterial CP (capsular polysaccharide) are attractive antimicrobial targets. However, we lack 
critical information about the structure and mechanism of many of them. In an effort to reduce that gap, we have 
determined three different crystal structures of the enzyme CapE of the human pathogen Staphylococcus aureus. The 
structure reveals that CapE is a member of the SDR (short-chain dehydrogenase/ reductase) super-family of proteins. 
CapE assembles in a hexameric complex stabilized by three major contact surfaces between protein subunits. 
Turnover of substrate and/or coenzyme induces major conformational changes at the contact interface between 
protein subunits, and a displacement of the substrate-binding domain with respect to the Rossmann domain. A novel 
dynamic element that we called the latch is essential for remodelling of the protein-protein interface. Structural and 
primary sequence alignment identifies a group of SDR proteins involved in polysaccharide synthesis that share the 
two salient features of CapE: the mobile loop (latch) and a distinctive catalytic site (MxxxK). The relevance of these 
structural elements was evaluated by site-directed mutagenesis. 
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INTRODUCTION 



Staphylococcus aureus is normally a harmless commensal bac- 
terium residing in 25 % of the adult population, yet is a dangerous 
human pathogen in susceptible individuals [1,2]. S. aureus is also 
notorious for its capacity to develop resistance against antibiot- 
ics. For example, methicillin- and vancomycin-resistant strains 
of S. aureus are among the major causes of nocosomial and com- 
munity infections in the USA [3,4]. We are in urgent need of new 
targets and innovative strategies to effectively combat S. aureus 
and other dangerous pathogens [5]. 



The biosynthetic machinery that generates the CP (capsular 
polysaccharide) is absent in humans, and represents an attractive 
target to fight S. aureus [6-8]. CP forms a thick layer of carbo- 
hydrate on the cell surface conferring anti-phagocytic properties, 
and helping S. aureus to persist in the bloodstream of the infected 
host. More than 70 % of clinical isolates of S. aureus belong to 
either the CP5 or the CP8 serotypes [8,9]. 

The basic structure of CP of serotypes CP5 and CP8 
consists of alternating units of three types of monosacchar- 
ides: Af-acetyl-L-fucosamine, A^-acetyl-D-fucosamine, and N- 
acetyl-D-mannosamine uronic acid. The UDP-L-FucNAc (UDP- 
activated form of A^-acetyl-L-fucosamine) is synthesized from its 



Abbreviations used: CP capsular polysaccharide; hUDGH, human UDP-a-D-glucose dehydrogenase; SDR, short-chain dehydrogenase/reductase; SEC, size-exclusion chromatography; 
UDP-6N 3 -GlcNAc, G'-Azido-G'-deoxy-UDP- N-acetylglucosamine. 
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Atomic co-ordinates and structure factors of holo-CapE, holo-K126E and CapE with substrate analogue UDP-6N3-GlcNAc are deposited in the PDB under entry codes 3VVB, 3VVC, and 
3W1V respectively. 
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Figure 1 Biosynthetic pathway of UDP-L-FucNAc in S. aureus 

(a) Synthesis of UDP-L-FucNAc requires the sequential activity of enzymes CapE, CapF and CapG. The enzyme CapE 
is a bifunctional enzyme catalysing the 4,6-dehydration of UDP-D-GlcNAc and a subsequently C5-epimerization. CapF 
catalyses the C3-epimerization of the previous intermediate, followed by the reduction of the keto-sugar at position C4. 
CapG catalyses a C2-epimerization to yield the final product UDP-L-FucNAc. The asterisks indicate the chemical groups 
subjected to enzymatic modification. This pathway is adapted from previous mechanistic studies [12,14]. (b) Structure 
of the substrate analogue UDP-6N 3 -GlcNAc, where the hydroxyl group at C6 of the substrate is replaced by an azide 
substituent [20]. 



glucosamine precursor (UDP-D-GlcNAc). This reaction has been 
described in S. aureus and several other pathogenic bacteria pos- 
sessing CP such as Streptococcus pneumoniae and Bacteroides 
fragilis [10,1 1]. L-FucNAc is also utilized in the synthesis of the 
lipopolysaccharide of some Gram-negative bacteria [12,13]. 

The transformation of UDP-D-GlcNAc in UDP-L-FucNAc 
requires three enzymes (CapE, CapF and CapG) in S. aureus 
[12,14]. These three enzymes catalyse a total of five chemical 
reactions (Figure 1). The recently determined crystal structure of 
CapF unveiled a unique architecture composed of two distinctive 
domains [15]: a N-terminal domain belonging to the SDR (short- 
chain dehydrogenase/reductase) superfamily of proteins [16], and 
a C-terminal domain displaying a standard cupin fold [17]. A 
previous study of the homologous protein WbjB, together with 
the structural study of the enzyme CapF (just downstream of 
CapE), indicates that CapE is a bi-functional enzyme [14,15]. 
The two reactions catalysed by CapE are the C-4/C-6 dehydration 
and the C-5 epimerization of the UDP-sugar. The enzyme FlaAl 
from Helicobacter pylori displays 40 % homology to CapE and 
catalyses the same enzymatic reactions [18,19]. However, the key 
catalytic tyrosine residue of FlaAl is replaced with methionine 
in CapE, suggesting mechanistic differences between them. 

Herein we present the first crystal structure of CapE, revealing 
that this enzyme forms a stable and functional homo-hexamer. 
From the comparison of three crystal structures, we identified a 
novel and dynamic element unique to the capsular-polysaccharide 
synthesizing proteins that we called the latch. The special con- 
figuration of the active site of CapE, together with the unique 
features of the latch, exposed a distinctive group of enzymes of 
bacterial origin within the SDR superfamily sharing these two 
special elements. 



EXPERIMENTAL 

Substrate analogue 

The synthesis of UDP-6N 3 -GlcNAc (fr-Azido-G'-deoxy-UDP-N- 
acetylglucosamine) was carried out as previously described [20]. 

Protein expression and purification 

CapE with a His 6 tag at the N-terminus was expressed in Escheri- 
chia coli BL21 (DE3) cells, and purified as described previously 
[15,21]. After SEC (size-exclusion chromatography), fractions 
of CapE were dialysed in 10 mM Tris/HCl (pH 9.0), 30 mM 
NaCl and 1 mM DTT (dithiothreitol), and concentrated with a 
100 kDa Centriprep filtration unit (Millipore) prior to crystalliza- 
tion. For activity assays, protein fractions were stored at — 20 °C 
in a solution supplemented with 40% (v/v) glycerol. Protein 
concentration was determined spectrophotometrically at 280 nm 
using the calculated molar extinction coefficient of the protein 
(s = 19200 M~ 1 cm -1 ). Muteins of CapE were prepared with a 
Quick-Change kit following the instructions of the manufacturer. 
Protein expression and purification were carried out as above. 

Protein crystallization 

Crystals of CapE suitable for X-ray diffraction analysis were ob- 
tained by the hanging drop method by mixing 1 /xl of fresh protein 
solution at 8 mg/ml and 1 /xl of crystallization solution. Crys- 
tals of wild-type CapE with coenzyme bound were obtained in 
0.72 M sodium succinate (pH 7.0) and 1.4 M potassium formate. 
Rod- shaped crystals grew to an approximate size of 
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Figure 2 Crystal structure of CapE 

Structure of (a) CapE with substrate analogue UDP-6N 3 -GlcNAc bound, and (b) K126E mutein with coenzyme NADPH bound. 
The coenzyme-binding domain, substrate-binding domain, and latch are coloured in blue, yellow and orange, respectively. 
The ligands are depicted as spheres with CPK colours. The figure was prepared with CHIMERA [50]. (c) Primary and 
secondary structure of CapE. The underlined sequence (red line) corresponds to the disordered region in K126E. 



30x30x 100 /xm 3 within 1 week. Crystals were transferred to 
a cryoprotectant solution consisting of mother liquor supplemen- 
ted with 25 % (v/v) glycerol, plunged into liquid N 2 and stored 
until data collection. 

Crystals of CapE with substrate analogue were obtained by the 
co-crystallization method using 100 /xM UDP-6N 3 -GlcNAc. Ex- 
tensive screening with an Oryx8 robot (Douglas Instruments) [22] 
yielded a suitable solution composed of 100 mM Hepes/NaOH 
(pH7.5) and 1.5 M Li 2 S0 4 . For freezing, single crystals were 
passed through a small drop of paratone (Hampton Research) and 
plunged in liquid N 2 . Crystals of mutein K126E were obtained 
in 100 mM Hepes (pH7.5), 2% (v/v) PEG400 and 2.0 M am- 
monium sulfate. Suitable crystals were soaked in mother liquor 
supplemented with 25 % (v/v) glycerol and 500 /xM substrate (the 
substrate was not observed in the electron density) and stored in 
liquid N 2 . 

Data collection and refinement 

Suitable crystals of CapE were mounted under a stream of cold ni- 
trogen (100 K) at beamlines BL5A and AR-NE3A of the Photon 
Factory (Tsukuba, Japan). Data were processed with the program 
MOSFLM [23] and merged and scaled using the program SCALA 



of the CCP4 program suite [24]. The structure of mutein K126E 
was determined by the method of molecular replacement with 
the program PHASER [25,26] using the coordinates of FlaAl of 
H. pylori (PDB entry code 2gn4) as the search model. Coordin- 
ate refinement was carried out with PHENIX [27], REFMAC5 
[28] and COOT [29]. Data were further refined with REFMAC5 
using TLS parameterization. Each protomer was split in the three 
groups shown in Figure 2 of the Results and Discussion section 
[30]. The structure of the apo-CapE and apo-K126E were determ- 
ined by molecular replacement using the structure of holo-CapE. 
Refinement was performed as above. Model quality was assessed 
with PROCHECK [31]. Data collection and refinement statistics 
are summarized in Table 1 . 



Enzymatic assay 

Enzymatic activity of CapE was monitored by the method de- 
scribed in Miyafusa et al. [15]. In a typical assay, substrate UDP- 
D-GlcNAc (Wako) at 200 /xM was mixed with 2 /xM CapE. The 
assay buffer was composed of 20 mM Tris/HCl (pH 8.0). Total 
volume was 100 /xl. Assay mixtures were incubated at 37 °C for 
2 h, after which the reaction was stopped by addition of 100 /xl of 
ice-cold phenol/chloroform/isoamyl alcohol at a 25:24:1 molar 
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Table 1 Data collection and refinement statistics 

Values in parenthesis correspond to the highest resolution bin. 



Wild-type 



Properties 



UDP-6N 3 -GlcNAc 



NADPH 



K126E 
NADPH 



Data collection 
Space group 
Unit cell 

Dimensions (A) 

Angles (°) 
Wavelength (A) 
Resolution range (A) 
Total observations 
Unique observations 
l/a (I) 

Completeness (%) 

Emerge (%)* 

Multiplicity 
Refinement 

Rwork/Rfree (%)t 

Number of protein chains 
Number of protein residues 
Number of protein atoms 
Number of ligands 
Number of ligand atoms 
Number of water molecules 
B-factor, protein (A 2 ) 
B-factor, ligands (A 2 ) 
B-factor, water (A 2 ) 
RMSD bonds (A) 
RMSD bonds (°) 
Co-ordinate error (A) 
Ramachandran plot 
Preferred regions (%) 
Allowed regions (%) 
Outliers (%) 
PDB codes 



P321 

a = b = 123.78, c = 104.46 
a = f3 = 90, y = 120 

I. 0000 
34.8-2.10 
291899 
54166 

II. 0 (4.1) 
99.9 (100) 
10.4 (33.8) 
5.4 (5.2) 

16.4/20.7 

2 

693 

5562 

18f 

168 

385 

29.0 

53.3 

36.0 

0.023 

2.19 

0.14 

97.5 

2.2 

0.3 

3W1V 



P6 3 22 

a = b = 125.18, c = 101.79 
a = f3 = 90, y = 120 

I. 0000 
34.05-2.80 
103149 
12071 
12.2 (2.8) 
99.9 (100) 

II. 9 (79.8) 
8.5 (8.9) 

23.8/27.6 
1 

262 

2137 

1 

48 
3 

81.9 

80.4 

49.2 

0.008 

1.363 

0.62 

92.7 

7.3 

0.0 

3VVB 



H3 2 

a = b = 160.5, c = 80.05 

a = f3 = 90, y = 120 

1.0000 

40.13-2.20 

93553 

20104 

8.9 (2.5) 

99.9 (100) 

8.8 (49.8) 

4.7 (4.4) 

19.6/26.4 
1 

314 

2504 

6 

73 

82 

50.1 

76.8 

46.5 

0.017 

2.01 

0.24 

95.9 

4.1 

0.0 

3VVC 



*Rmerge = S hk | S/|/(hkl)/ - [/(hkl)]|/E hW S//(hkl). 

-fR work = Ehki |F(hkl) 0 - [F(hkl) c ]|/Ehk|F(hkl) 0 ; Rfree was calculated as R wor k> where F(hkl) 0 values were taken from 5% of data not included in the 
refinement. 

f This structure contains 14 sulfate ions from the crystallization solution. 



ratio. The supernatant containing the sugars were mixed with 
100 /xl of chloroform and analysed by HPLC using a CarboPac 
PA1 anion-exchange column (Dionex) as described previously 
[32]. The overall conversion to products was calculated from the 
consumption of substrate UDP-D-GlcNAc. 



baseline was subtracted from the protein thermogram and the 
data subsequently were normalized by protein concentration and 
adjusted to a two-stage unfolding curve [33] using the program 
ORIGIN supplied by the manufacturer. 



Differential scanning calorimetry 

The thermal stability of CapE was determined in a VP-capillary 
microcalorimeter (GE Healthcare). Proteins at a concentration 
of 1 8 /xM were equilibrated in a solution containing 50 mM 
Hepes (pH7.4) and 150 mM NaCl. Thermograms were recor- 
ded between 283 K and 373 K at a rate of 1 K min ~ 1 . The buffer 



RESULTS AND DISCUSSION 



Crystal structure of CapE 

We determined three crystals structures of CapE by X-ray diffrac- 
tion methods at resolutions 2.1-2.8 A (Table 1). Two structures 
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Figure 3 Conformation of substrate analogue and coenzyme 

(a) Stereoview of the substrate analogue UDP-6N 3 -GlcNAc. The electron density correspond the sigma-A weighted 2Fo-Fc 
map contoured at 1.0 a. The substrate analogue is depicted with CPK colours, (b) Conformation of various nucleotide-sug- 
ars bound to representative SDR enzymes. The panel depicts CapE (this study, red); CDP-/3-D-xylose bound to CDP-D-gluc- 
ose 4,6-dehydratase (lwvg, yellow); GDP-rhamnose bound to GDP-mannose 4,6-dehydratase (ln7g, cyan); dTDP-D-glucose 
bound to dTDP-glucose 4,6-dehydratase (D128N/E129Q mutein, lr6d, pink); and UDP-GlcNAc bound to FlaAl (UDP-GlcNAc 
inverting 4,6-dehydratase, 2gn4, green), (c) Residue environment around the substrate analogue in wild-type CapE and (d) 
in the binding pocket of the coenzyme in K126E. Panels (c) and (d) were generated with the program LIGPLOT [51]. 



correspond to the binary complex of the enzyme with coenzyme 
(wild-type protein and inactive mutein K126E). The third struc- 
ture corresponds to wild-type protein (coenzyme-free) in com- 
plex with the substrate analogue UDP-6N 3 -GlcNAc. 

A three-dimensional homology search in the DALI server [34] 
indicates that CapE is a member of the extended SDR superfamily 
[35]. As often seen in this group of enzymes, CapE shares little or 
very little similarity at the primary sequence level with the other 
members of this family (10-25% identity). The only exception 
is FlaAl of Helicobacter pylori (40 % identity), which is also the 
closest structural homolog of CapE in the protein data bank (PDB 
entry code 2gn4; rmsd = 2.0 + 0.1 A) .We note that the enzyme 
following CapE in the biosynthetic route of UDP-D-FucNAc in 
S. aureus, CapF, has the lowest similarity score among the first 
950 structural hits found with the program DALI [34]. 

CapE consists of three well-defined regions: a Rossmann 
domain for binding the coenzyme NADPH, a substrate binding 
domain, and a 23 -residue loop peripheral to the protein core 
(Figures 2 A and 2B). The first two regions are well-conserved 
elements of the SDR family of enzymes [35]. Segments 
corresponding to each domain are interspersed along the primary 
sequence of the enzyme (Figure 2C). The loop comprising 



residues 287-309 constitutes a novel structural element in 
this family of proteins. We called this loop the latch because 
it engages two molecules of CapE in the crystallographic 
dimer. The latch is not observed in the two crystal structures 
of CapE with coenzyme bound because of dynamic disorder. 
In contrast, the latch is clearly observed in the structure 
with substrate analogue bound (see below). We note that the 
tertiary structure of CapE and FlaAl differs significantly from 
each other at the latch region (Supplementary Figure SI at 
http://www.bioscirep.org/bsr/033/bsr033e043add.htm).Whereas 
the latch connects two protomers of CapE of the hexameric 
complex, the equivalent loop of FlaAl (residues 292-318) folds 
into its own protein chain. 

The substrate-binding domain consists of residues 168— 
199, 227-260 and 310-337. The substrate- analogue UDP-6N3- 
GlcNAc is clearly visible in the electron density map (Fig- 
ure 3A). The electron density of the azide moiety is weaker 
than that in other sections of the substrate analogue, prob- 
ably because of radiation damage at this functional group. 
Although the sugar ring of the substrate analogue protrudes 
into the coenzyme domain (where the active site is located), 
it occupies a non-catalytic conformation compared with that 
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of other representative SDR enzymes (Figure 3B). The con- 
formation of UDP-6N 3 -GlcNAc bound to CapE is similar to 
that of the inhibitor UDP-D-GalNAc bound to the homologous 
enzyme FlaAl [18] (Supplementary Figure SI). The substrate 
analogue engages residues of the binding pocket through mul- 
tiple polar interactions (Figure 3C and Supplementary Table SI 
at http://www.bioscirep.org/bsr/033/bsr033e043add.htm). In par- 
ticular, the UDP moiety of the substrate analogue establishes nu- 
merous H-bond and electrostatic interactions with polar residues 
of the protein. 

The coenzyme-binding region is the largest functional ele- 
ment of CapE. This domain adopts a classical Rossmann 
fold [16,35], comprising residues 1-167, 200-226 and 261- 
286. Although NADPH was not added during protein purific- 
ation, or during the crystallization trials, it is clearly visible in 
the crystal structure of wild-type and K126E, suggesting that the 
cofactor binds the enzyme tightly (Supplementary Figure S2 
at http://www.bioscirep.org/bsr/033/bsr033e043add.htm). A de- 
tailed view of the local environment around the coenzyme is de- 
picted in Figure 3(D). The coenzyme engages in a dense network 
of non-covalent interactions with the enzyme, including polar 
and van der Waals forces, and hydrophobic interactions. Unex- 
pectedly, the coenzyme moiety is not observed in the structure 
of CapE in complex with substrate analogue. This is surprising 
because the superposition of the two types of CapE complexes 
does not suggest direct clashes between the substrate analogue 
and the coenzyme moiety. It is possible that the departure of 
the coenzyme is stimulated by a kinetic mechanism similar to 
that of SDR proteins ArnA and UDP-xylose synthase [36]. Al- 
ternatively, it is conceivable that the substrate analogue binds to 
CapE in a non-native conformation that is incompatible with the 
binding of the coenzyme. 

CapE is a functional hexamer 

Analysis of the crystallographic symmetry reveals that CapE is 
a homo-hexamer in the crystal form (Figure 4). The protein as- 
sembles as a trimer of dimers (3x2). The hexameric organiza- 
tion of CapE constitutes a rare example within the SDR fam- 
ily. This organization is only shared by FlaAl, which is the 
SDR enzyme with the highest structural homology to CapE 
[18]. SEC indicates that the hexameric complex is the pre- 
dominant species in solution (Figure 4B). The hexamer co- 
exists with a small fraction of monomer (~8%), but not di- 
mer. Moreover, CapE undergoes two- state thermal unfolding 
without intermediates, suggesting that the hexamer is the most 
stable form of CapE in solution (Supplementary Figure S3 at 
http://www.bioscirep.org/bsr/033/bsr033e043add.htm) [37] . 

The evaluation of the contact interface with the PISA server 
[38] indicates that the hexamer of CapE (with substrate analogue 
bound) buries nearly 24000 A 2 of surface area upon oligomeriza- 
tion, i.e. 24 % of the total solvent-exposed surface. Each protomer 
of CapE interacts with adjacent molecules through three major 
interfaces (Figure 4C). The largest interface, between molecules 
A and B, buries 5016 A 2 and involves 122 residues (28 % of total) 
(Table 2). This interface corresponds to the typical dimerization 



interface observed in numerous SDR enzymes [39,40]. The in- 
terfaces A-F and A-C are smaller, burying 2044 A 2 and 934 A 2 , 
respectively. 

The A-B and A-C surface areas of CapE in complex 
with coenzyme are significantly smaller than that of CapE 
with substrate analogue bound. The A-B and A-C surface 
areas calculated from the coordinates of CapE with coen- 
zyme bound are reduced by 60 and 65%, respectively (Fig- 
ure 4C, Table 2). The reason is the dynamic disorder oc- 
curring at these two contact regions. In fact, the number of 
residues observed at these two interfaces is greatly diminished 
(Table 2). On the contrary, the A-F interaction surface remains 
essentially unchanged regardless of the ligand present in the 
crystal. The surface area, number of residues and number of 
non-covalent interactions in the A-F region remains constant 
among the three crystals structures (Supplementary Table S2 at 
http://www.bioscirep.org^sr/033/bsr033e043add.htm). hUDGH 
(human UDP-a-D-glucose dehydrogenase) is a well-studied 
hexameric enzyme showing large structural changes during 
its catalytic cycle [41,42]. The hexamer-building interface of 
hUDGH is quite flexible and therefore the hexamer readily dis- 
sociates into dimers and tetramers. On the contrary, the hexamer 
of CapE seems more stable and remains together even after the 
disruptive effects described above, or after large conformational 
changes (see below). We cannot rule out that some of the con- 
formational changes are influenced by crystal packing forces, and 
therefore may not be of functional significance. 

Conformational changes 

The Rossmann domain of CapE is located at the centre of the 
hexamer and mediates most of the inter-chain contacts, whereas 
the substrate -binding domain is found at the periphery of the 
complex (Figure 5). This arrangement could facilitate conforma- 
tional changes during the catalytic cycle at the substrate-binding 
domain, since this domain is more exposed to the solvent. 

The crystals structures of CapE with analogue bound or with 
coenzyme bound were compared with the program DYNDOM 
[43] to quantify their conformational changes. The structure of 
K126E with coenzyme bound was chosen over the equivalent 
structure of wild-type CapE because of the better resolution 
achieved with the mutated protein (Table 1). The analysis with 
DYNDOM indicates that the substrate-binding domain rotates 
11° and moves a few angstroms towards the coenzyme-binding 
domain upon exit of the coenzyme and binding of substrate ana- 
logue (Figure 5B). The largest shift occurs at residues Gly 256 
and Gly 252 (6.0 A).Although these observations must be under- 
stood within the context of the crystal structure, they are sug- 
gestive of a significant mobility at the substrate domain during 
catalysis. 

Another important dynamic element is the latch (residues 
287-309, Figure 6). The latch occupies a position between 
that of the substrate-binding and Rossmann domains (Fig- 
ure 5A). The latch is disordered in the two crystal structures 
with coenzyme bound (Figure 6B and Supplementary Fig- 
ure S4 at http://www.bioscirep.org/bsr/033/bsr033e043add.htm). 
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Figure 4 Hexameric organization of CapE 

(a) CapE forms a hexamer organized as a trimer of dimers (3x2). The structure corresponds to wild-type CapE in complex 
with substrate analogue (ligand is not shown). The protein subunits of one of the pseudo-dimers are depicted in dark and 
light blue (chains A and B). The other subunits are depicted in black and grey, (b) SEC profile of CapE. The arrows indicate 
the elution of the calibration standards. CapE elutes with an apparent molecular weight of 210 kDa. A minor fraction of 
monomer (8%) with an apparent molecular weight of 40 kDa is also observed. The predicted molecular weight of one 
subunit of CapE is 39 kDa, and that of the hexamer is 230 kDa. The peak corresponding to the dimer was not observed, 
(c) Interaction surfaces between protein subunits. The three main contact interfaces correspond to subunits A-B, A-F and 
A-C. (d) Same interaction surfaces in mutein K126E. 



The latch connects two contiguous subunits of CapE, and forms 
a large fraction of the A-B and A-C interface of the hexamer 
with very high shape complementarity (Sq = 0.79 [44]) (Fig- 
ure 6C and Supplementary Figure S5 at http://www.bioscirep.org/ 
bsr/033/bsr033e043add.htm). Together with the residues of 
neighbouring subunits of CapE, the latch contributes >3500 A 2 
of buried surface area, including 28 H-bonds. The interaction 
between the latch and the adjacent protomer is also held together 
by contacts involving four large hydrophobic residues: Leu 288 , 
Tyr 290 , Tyr 293 and He 301 . In addition, the latch occupies a stra- 
tegic position at the entrance of the substrate-binding pocket, 



potentially regulating the access of the substrate to the active site 
(Figure 6C). 

A search of tertiary structure homologues of the latch with 
the DALI server [34] did not find any example of this struc- 
tural element in the protein data base. For example, the SDR 
enzyme FlaAl (the closest structural homologue of CapE) also 
forms a hexamer of similar size to that of CapE, but do not 
exhibit this element (Supplementary Figure SI) [18]. On the 
contrary, the program Protein BLAST indicates that the primary 
sequence of the latch is well conserved among a variety of Gram- 
positive and Gram-negative bacteria of pathogenic potential 
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Table 2 Interaction surface area between CapE protomers 

Interface* Protein Buried area (A 2 ) Number of residues! No. H-Bond 



A-B 


WT (UDP-6N 3 -GlcNAc) 


5016 


122 


28 




WT (NADPH) 


2162 


58 


4 




K126E (NADPH) 


1988 


58 


4 


A-F 


WT (UDP-6N 3 -GlcNAc) 


2044 


58 


12 




WT (NADPH) 


2066 


58 


10 




K126E (NADPH) 


2070 


58 


12 


A-C 


WT (UDP-6N 3 -GlcNAc) 


934 


32 


6 




WT (NADPH) 


346 


14 


0 




K126E (NADPH) 


334 


14 


0 


Total 


WT (UDP-6N 3 -GlcNAc) 


7994 


212 


46 




WT (NADPH) 


4574 


130 


14 




K126E (NADPH) 


4392 


130 


16 



*Protomers of CapE as shown in Figure 4(A). 

tSome residues are counted more than once because they participate in more than one surface. 




Figure 5 Conformational changes in the substrate-binding domain 

(a) Topology of the hexamer. The coenzyme-binding domain (blue) and the substrate-binding domain (red) occupy the 
central and peripheral regions of the complex, respectively. The latch (orange) is located in an intermediate position, (b) 
Conformational change. Binding of the substrate analogue, and loss of the coenzyme, induces a rotation of 11° and a 
displacement of the substrate domain towards the Rossmann domain. Dark and light colours correspond to wild-type 
protein and K126E, respectively. The rotation angle was calculated with the program DYNDOM [43]. 



(Figure 6D). The genera of these pathogenic bacteria are Pseudo- 
monas, Acidovorax, Hyphomicrobium, Vibrio, Pasteurella, 
Fusobacterium, Listeria, Bacillus and Enterococcus (in addition 
to Staphylococcus). Serotypes of a majority of these bacteria are 
encapsulated, suggesting a direct association between this novel 
group of SDR enzymes, the production of CP and pathogenesis 
[6,12,45-49]. 

Site-directed mutagenesis supports the structural 
findings 

The coenzyme-binding domain hosts the canonical catalytic 
triad, which invariably contains the sequence YxxxK in SDR 
enzymes (except for a minor group of divergent SDR en- 
zymes, which are characterized by a YxxMxxxK motif). 
Strikingly, the corresponding sequence of CapE is altered to 
M 134 xxxK 138 without the catalytic tyrosine. Importantly, the se- 
quence alignment of the family of proteins sharing the latch 



(see Figure 6D) demonstrates an absolute conservation of 
the catalytic residues of CapE (Supplementary Figure S6 at 
http://www.bioscirep.org/bsr/033/bsr033e043add.htm). In con- 
trast, the other SDR structural homologues do invariably pos- 
sess the canonical YxxxK motif (Supplementary Figure S7 at 
http://www.bioscirep.org/bsr/033/bsr033e043add.htm). We pro- 
pose that the unique structural element that we called the latch, 
and the unique composition of the active site residues define a 
novel subfamily of polysaccharide- synthesizing enzymes within 
the SDR super-family of proteins. 

To strengthen the structural analysis we prepared sev- 
eral muteins of CapE by site-directed mutagenesis (Fig- 
ure 7 and Supplementary Table S3 at http://www.bioscirep.org/ 
bsr/033/bsr033e043add.htm). No excess coenzyme was added 
in the enzymatic assay because CapE was purified in com- 
plex with the coenzyme. The level of NADPH bound to CapE 
was essentially constant among all muteins as estimated from 
the ratio Abs260/Abs280 (Supplementary Table S3). Firstly, 
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(d) 



CapE 

Q9XC61 

A1WA85 

G4IPV8 

C9PJA1 

Q9CM37 

C3WC53 

D7UYE7 

B j' HXA 7 

K1AJQ2 



280 
281 
284 
2S1 
231 
281 
284 
286 
280 
2B0 



Large Hydrophobic Residues 

RVPADSHIILN¥SN¥VE TGKEK I TQS Y EYNHD 310 

R I PFDLHDLNiSKt'VEQGEEK ITHT EDYNSH 311 

RVPAD GRDLN Y AKF VEQGDQKLTQTTRGE DYNSH 317 

RVPPDSRDLHYEKFVEQGE RR I TEA Q DYNSN 311 

HUPPDLRDLNYSKYVEEGDABIAQI E DYNSH 311 
R I PAD Q RSLNYSK YVE KGE PK I TEVT - - - DYNSH 311 
RVPADD RDLNYSK YFEQGQE V I TQA- — EE YNSH 314 
R I PADS RDLHYSN YFE DGSEA X TEA Y EYNSE 316 

RVPAD CRDLNYDKYFEVGNKKLTN IE EYNSD 310 

RVPAD T RDLNYDKY F VEG SKELTQE G EYNSD 310 



Co-ibcrvation 



Figure 6 Analysis of the latch 

(a) Structure of the latch at the A-B dimer interface. The latch (orange) of subunit A (grey ribbons) associates with 
the substrate-binding domain of molecule B (grey surface). The substrate analogue is depicted only in subunit B (CPK 
representation), (b) Same view of K126E. The latch is not modelled because of disorder. The dotted line represents the 
hypothetical position of the latch in the same position as shown in panel (a), (c) Close view of the latch of subunit A 
interacting with the substrate-binding pocket of subunit B. The hydrophobic residues Leu 288 , Tyr 290 , Tyr 293 and lie 301 fit on 
the groves of subunit B with high surface complementarity, (d) Sequence alignment of the residues belonging to the latch. 
The panel shows the top-ten solutions found by BLAST [52]. The sequences belong to various genus of Gram-positive 
and Gram-negative pathogenic bacteria (from top to bottom: Staphylococcus, Pseudomonas, Acidovorax, Hyphomicrobium, 
Vibrio, Pasteurella, Fusobactehum, Listeria, Bacillus and Enterococcus). Each sequence is identified by their accession 
code in the UNIPROT database. 



(a) 
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(b) 



i 



Active Site 



Interface 



Substrate 
I Peak 1 



Peak 2 



No CapE 



LA. 



Peak 1 / Peak 2 
2.7±0.2 



Peak 1 / Peak 2 
0-7+0.07 



CapE 



Y293A 



12 



14 



1 1 

16 18 

Elution (minutes) 



20 



-1 

22 



Figure 7 Mutational analysis 

(a) Wild-type CapE or muteins (2 /xM) were incubated with UDP-D-GlcNAc (200 /xM) for 2 h at 37 °C. Consumption of 
substrate UDP-D-GlcNAc was monitored by HPLC as described in experimental procedures. The bars correspond to the 
average of three independent assays + S.D. (b) Representative HPLC profiles. The substrate UDP-D-GlcNAc was incubated 
with no CapE (top), with wild-type CapE (middle) and with mutein Y293A (bottom). 
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individual residues of the active site and substrate-binding pocket 
were mutated: Asp 125 , Lys 126 , Met 134 and Glu 257 were changed 
to Ala (Lys 126 was also mutated to Glu and its crystal structure 
discussed above). All these individual muteins exhibited a much- 
diminished activity compared with the wild-type enzyme. The 
muteins K126A, K126E and E257A where completely inactive, 
demonstrating the importance of these residues for the functional 
binding of the substrate. Meanwhile the activity of D125A and 
Ml 34 A was strongly impaired (4-6-fold lower conversion than 
WT protein). We note that all the muteins eluted as hexamers 
during their final step of purification by SEC. 

Secondly, we evaluated the properties of the A-B surface (es- 
sentially the latch) by mutating separately five different residues 
to Ala: Phe 91 , Tyr 290 , Tyr 293 , Tyr 305 and Tyr 307 . The last four 
muteins belong to the latch region. The mutations did not alter 
significantly the levels of conversion compared with the wild- 
type enzyme (Figure 7A). However, although the values of con- 
version of Y293A and wild-type protein are identical, the ratio 
of their enzymatic products (computed from the area under the 
HPLC peaks) is inverted in Y293A in comparison with wild- 
type CapE. The relative areas of peak-1 with respect to peak-2 
in wild-type or in mutein Y293A were 2.7 and 0.7, respectively 
(Figure 7B). Similarly, the selectivity is inverted in all muteins 
belonging to the latch, but not in F91A, a residue not belonging 
to the latch (Supplementary Table S3). Changes in the relative 
abundance of intermediates do not necessarily reflect a change of 
their thermodynamic equilibrium, but probably a slow down 
of their rate of interconversion. Our next challenge will consist in 
the identification of these unstable intermediates, which will al- 
low an in-depth discussion of the structure/function relationship 
and catalytic mechanism of CapE. We note that in contrast to the 
results obtained with the muteins belonging to the interface A-B, 
the mutein F92A of the A-F interface led to a much-reduced 
activity (<20% conversion). 

Conclusion 

We determined the first set of crystal structures of the enzyme 
CapE of S. aureus belonging to the biosynthetic pathway of CP. 
CapE forms a robust hexamer held together by three different 
interfaces, A-B, A-F and A-C. The largest A-B interface con- 
tains a mobile motif that we denominated the latch. The latch is 
a flexible element that increases the contact surface area of adja- 
cent monomers of CapE in the presence of the substrate analogue 
UDP-6N 3 -GlcNAc. The structural analysis also demonstrates a 
rotation of the substrate-binding domain when the substrate ana- 
logue binds to the enzyme. Importantly a sequence alignment of 
the latch, and of the active site residues of CapE, has identified 
a novel subfamily of SDR enzymes involved in the synthesis of 
polysaccharide of Gram-positive and Gram-negative pathogenic 
bacteria. 
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Panel (a) is compared with that of FlaAl (b) using the software PDBe fold. The conformation and orientation of the latch 
region (in this figure from Leu 288 to Thr 302 ) clearly differs from the equivalent region in FlaAl (from Gin 292 to Ala 310 ), (c) 
Structure of nucleotide-sugars bound to CapE and FlaAl. The panel depicts the substrate analogue bound to CapE (red), 
the substrate analogue UDP-D-GalNAc bound to FlaAl (PDB entry code 2gna, blue) and the substrate UDP-D-GlcNAc bound 
to FlaAl (PDB entry code 2gn4, yellow). 
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Atomic co-ordinates and structure factors of holo-CapE, holo-K126E and CapE with substrate analogue UDP-6N3-GlcNAc are deposited in the PDB under entry codes 3WB, 3WC, and 
3W1V respectively. 




FlaAl 



Figure SI The structure of the protomer of CapE 
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WT + Substrate Analog K1 26E + Coenzyme 



Figure S2 Coenzyme is absent in the binary complex with substrate analogue 

Electron density map in the cofactor binding site of (a) the binary complex of wild-type CapE with UDP-6N 3 -GlcNAc bound 
and (b) mutein K126E. The green mesh corresponds to the sigma-A weighted 2Fo-Fc electron density map contoured at a 
level of l.Ocr. In panel (a), the black sticks correspond to the superimposed coordinates of NADPH from the structure in 
panel (b). The electron density maps cover a radius of 2.0 A from any atom belonging to the coenzyme. 
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Figure S3 Thermal unfolding of CapE monitored by DSC 

CapE (18 /xM) was heated from 283 to 373 K at a rate of 1 K min " 1 . The 
thermodynamic parameters obtained from fitting the data to a two-state 
thermal transition are T M = 62.5 + 0.1 °C and AH° = 105 + 1.7 kcal 
mol -1 . 



© 2013 The Author(s) This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC-BY) (http://creativecommons.Org/licenses/by/3.0/) 
which permits unrestricted use, distribution and reproduction in any medium, provided the original work is properly cited. 



Crystal structure of CapE 



Original Paper 



fa) 



Latch 




(b) 



Coenzyme 
domain 



Substtrate 
domain) 




Of 

Latch ^ 

(Coodinates of Latch belong to 
WT + Substrate Analog) 



Coenzyme 
domain 



Subsftrate 
domain 



WT + Substrate Analog 



K126E + Coenzyme 



Figure S4 The latch is disordered in the apo form of CapE 

Comparison of the electron density maps in the region of the latch in (a) CapE with substrate-analogue bound and 
(b) K126E with coenzyme bound. The green mesh corresponds to the sigma-A weighted 2Fo-Fc electron density map 
contoured at a level of 1.5 a. For comparative purposes, the structure of the latch shown in panel (b) corresponds to that 
in CapE with substrate analogue. The electron density maps cover a radius of 2.0 A from any atom belonging to the latch. 
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Figure S5 The latch contacts multiple subunits of CapE 

Surface-representation of the latch region in the structure of (a) CapE in complex with UDP-6N 3 -GlcNAc and (b) K126E 
in complex with coenzyme. The latch of subunit A is depicted as a semi-transparent orange surface. The latch does not 
appear in panel (b) because of dynamic disorder. 
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Catalytic Triad 

• * j - * • ******* ******* ; ******** 

114 AI HQNVKKV I CLSTDKAAY P INAMG I SKAMMEKV 144 

114 AlGNGVKKWCLSTDKAVirP INAMG I SKAMMEKV 144 

117 A I GAGVK R WCLS TDKAV Y P I NAMG I S KAMME KV 147 

114 A I Q C A V S EV I C L S T D K A V Y P I M AMG I S KAMME KV 144 

114 A I AN Q V Q R WCLS TDKAV Y P I NAMG I S KAMME KV 144 

114 A I Q N Q I K R WCLS TDKAV Y P I NAMG I S KAMME K V 144 

117 A I ASGVK R WCLS TDKAA Y P I N AMGM S KALME KV 147 

120 AIKANVKKVICLSTDKAAYPINAMGISKAMMEKV 150 

114 S I E YDVKKV I CLSTDKAAYP I NAMG X SKAMMEKV 144 

114 AIEMGIQKV I CLSTDKAAYP INAMG I SKAMMEKV 144 




Figure S6 Sequence alignment of the region of CapE containing the catalytic triad 

The sequences correspond to the same set of proteins aligned in Figure 6 of the manuscript. 



CapE 

Q9XC61 

A1WA85 

G4IPV8 

C9PJA1 

Q9CM37 

C3WC53 

D7UYE7 

B7HXA7 

K1AJQ2 
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Figure S7 Sequence alignment of the top 55 structural homologues found in the DALI server (only published structures 
were considered) 

The position of the catalytic triad in the SDR family of proteins is highlighted. Only CapE displays a Met at the catalytic 
site. 
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Table SI Polar interactions between UDP-N 3 -GlcNAc and 
CapE 



Number 


Substrate 


Residue [atom] 


Distance (A) 


1 


[02'] 


Glu 257 [0E1] 


2.71 


2 


[02'] 


Glu 257 [0E2] 


3.45 


3 


[03'] 


Val 86 [N] 


2.78 


4 


[04'] 


Lys 84 [0] 


2.80 


5 


[N3] 


Thr 189 [0] 


2.57 


6 


[02] 


Thr 191 [N] 


2.99 


7 


[04] 


Thr 189 [0G1] 


3.30 


8 


[02B] 


Arg 254 [NH2] 


2.79 


9 


[01A] 


Val 174 [N] 


2.84 


10 


[02] 


Thr 191 [0G1] 


3.17 


11 


[02'] 


Thr 191 [0G1] 


2.69 


12 


[01B] 


Lys 126 [NZ] 


2.86 


13 


[02'] 


Thr 191 [0G1] 


2.69 


14 


[03'] 


Gin 85 [0E1] 


3.00 



© 2013 The Author(s) This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC-BY) (http://creativecommons.0rg/licenses/by/3.0/) 
which permits unrestricted use, distribution and reproduction in any medium, provided the original work is properly cited. 



Crystal structure of CapE 



Original Paper 



Table S2 Intermolecular polar contacts between subunits of CapE (A) 

Since CapE is a homomer, and to avoid duplications, only one set of interactions is given. 

(a) 

Number Chain A Chain B Substrate analogue wild-type NADPH wild-type NADPH K126E 

Hydrogen bonds 



1 


Lys 143 [NZ] 


Glu 90 [0E1] 


2.56 


2.69 


2.30 


2 


Lys 143 [NZ] 


lie 131 [0] 


3.13 


- 


- 


3 


Argiso [NH1] 


Glu 90 [0] 


3.30 


3.22 


3.64 


4 


Thr 253 [N] 


Thr 302 [0G1]* 


3.14 


- 


- 


5 


Thr 253 [N] 


Asn 298 [0]* 


3.26 






6 


Thr 253 [0G1] 


Thr 302 [0G1]* 


2.69 






7 


Arg 254 [NH1] 


Val 294 [0]* 


2.86 






8 


His 255 [N] 


Asn 292 [0]* 


2.85 






9 


His 255 [ND1] 


Asn 292 [0]* 


3.28 






10 


His 255 [NE2] 


Gin 271 [0E1]* 


3.29 






11 


Gly 256 [N] 


Glu 267 [0E2] 


2.69 






12 


Arg 266 [NH2] 


Glu 257 [0] 


3.19 






13 


Asn 298 [N]| 


lie 251 [0] 


3.65 






14 


Asn 298 [N]| 


Thr 253 [0] 


2.67 






alt bridges 












1 


Lys 143 [NZ] 


Glu 90 [0E1] 


2.64 


2.69 


2.30 



(b) 

Number Chain A Chain F Substrate analogue wild-type NADPH wild-type NADPH K126E 



Hydrogen bonds 



1 


Arg 37 [NH1] 


Arg 37 [0] 




3.05 


3.38 


2 


Lys 84 [N] 


Glu 39 [0E1] 


2.81 


2.81 


2.90 


3 


Gin 85 [N] 


Glu 39 [0E2] 


2.68 


2.75 


2.95 


4 


Ser 88 [OG] 


Glu 39 [0E2] 


3.02 


2.92 


2.88 


5 


Lys 98 [NZ] 


Asp 63 [0D2] 


2.63 


2.52 


2.68 


6 


Glu 95 [0E2] 


Ser 66 [OG] 


2.52 






7 


Arg 37 [NH2] 


Thr 99 [0G1] 


3.79 






8 


Arg 62 [NH2] 


Asp 60 [0D2] 






3.46 


alt bridges 












1 


Arg 37 [NH1] 


Asp 60 [0D2] 


3.11 






2 


Arg 37 [NH2] 


Asp 60 [0D1] 


2.81 






3 


Lys 98 [NZ] 


Asp 63 [0D2] 


2.63 


2.52 


2.68 


4 


Arg 62 [NH2] 


Asp 60 [0D2] 






3.46 



* Residues belonging to the latch are disordered in the two structures of CapE with NADPH-bound. 
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Table S3 Activity of wild-type CapE and muteins 

N.D., no product was detected by HPLC; N.A., not applicable because of low conversion. 



Protein 


Position of mutation 


Absorbance ratio 280 nm/260 nm 


Conversion 


Product ratio 


Wild-type 




0.95 


97.6±2.3 


2.7±0.2 


D125A 


Active site 


1.12 


14.5 ±5.6 


N.A. 


K126A 


Active site 


1.00 


N.D. 


N.A. 


K126E 


Active site 


1.05 


N.D. 


N.A. 


M134A 


Active site 


1.11 


27.0±3.7 


N.A. 


E257A 


Active site 


0.96 


N.D. 


N.A. 


F91A 


A-B surface 


1.00 


77.3±3.4 


1.5±0.1 


F92A 


A-F surface 


1.16 


17.3 ±2.8 


N.A. 


Y290A 


Latch 


1.00 


75.5±2.1 


0.4±0.05 


Y293A 


Latch 


1.09 


97.6±0.6 


0.7 ±0.07 


Y305A 


Latch 


1.10 


86.0 ±1.6 


0.3 ±0.02 


Y307A 


Latch 


0.98 


85.4 ±3.4 


0.8 + 0.1 
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